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(57) Abstract 

PROBLEM TO BE SOLVED: To obtain a new promoter 
comprising a DNA derived from the genus Aspergillus 
having a specific base sequence and allowing a mold 
belonging to the genus Aspergillus transfomied by a 
vector comprising the DNA sequence to produce a foreign 
protein. 

SOLUTION: This new promoter is shown by a base 
sequence of, e.g. fonnula I or II and derived from 
Aspergillus oryzae belonging to the genus Aspergillus, 
and a foreign protein may be produced by a mold 
belonging to the genus Aspergillus transformed by a 
vector comprising the promoter sequence. The promoter 
is obtained by extracting the whole DNA from Aspergillus 
oryzae grown in the presence of glucose, passing it 
through an oligo(dT)cellulose column to separate mRNA 
therefrom, synthesizing cDNA by using it, conventionally 
forming a cDNA library, and cloning it so as to search a 
gene highly expressed, thus obtaining the resultant 
promoter comprising e.g. an alcohol dehydrogenase gene 
strongly expressed in the presence of glucose. 
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(54) mm<o4im T7.^)v^)V7.mmm(o»mi3i:/a^-i^- 



(57) [M»5J 

T;?^^/^^/]^;;^ (Aspergillus) JR 

n^r*-?-— ^'I (alcoholdehydrogenase I) J^fiT*^ =i>^ 
H/Jf/Wl: V / -'^>lU:t^iy^--^ (coproporphyri 
nogen III oxidase) itCT". ^^V--:^ • h^^^y^^^ 
— (hexose transporter) t: y?. h >'H4. 1 (h 

iston H4. 1) ite^x 40Sy 7}^V^-A®eKS17 (40S ri 
bosomal protein S17)ite^^Xmosy ^^y'-i^^SSSS 
28 (40Sribosomal protein S28) itfs^ 



I AbeiMfls 



£b»8 



PDlyAtatI 



L «yyz» »cw pdWA 

o 



1 

fc Kay-;?-— if I (alcohol dehydrogenase I) 

3:/'n7K/Wt: >r y /^^>'ni:^^V^— (copro 
porphyrinogen III oxidase) ^^^f^y— • h 

(hexose transporter) iSfi^^ \:: 7s h 

\ (histon H4.1) ste^. 4osy TKy'-Asegsi 

7 (40S ribosomal protein SI7) it^2^^ 40Sy/1^'y'— 
i^SeKS28 (40S ribosomal protein S28) ltL&^(^i^ 

[0 0 0 1] 

imm(om'r^mm] ^^m\t^ r^^yi-^/i^:^ (Aspe 
rgiiius) m^^^m<Dmm.y'^'^-^-\^mir^o mi- 

'^'5fcii6(Cie::.i|/^DNA»fjt€:*-r^:7^'7;^5: Fi> ^ 
10 0 0 2] 

mmmmm^^vx\^^^o 
[0 0 0 31 zthh<omAi^^9. M^mirStmm&n 

(omi±mm^±t i^xmrn-r^^tfimm^thx^^^fc, 

[0 0 0 4] ^(D^mi)^b. ^mm\^^^'f^^titi:yr2 



(2) 1 1 -2 7 6 1 7 0 

2 

S3le(7>i//i-=ir ^ 7— ^sfieT-<^>^n^— [Bi 
otechnology, 5, 368 (1987) ] , h y zi^/w-^ . y — 

^- [Biotechnology, 7, 596 (1989) ] . T y^^Ji^^ 
• ;^y-^*j|eCOa-r ^ ^-ifite^^^^^t^'^-^?^ 
— [Biotechnology, 6,1419 (1988) ] (Itr?! 8362-27 29 
88) . r:^^)\^^}'^:^ • K^V'^^S^f^^^T/t-:^!— 
t Kn^-:f— ^ I iftfi-^^cDT^n^— [Biotechnolog 
10 y, 5, 713 (1987) ] , T ;^-</U^y^;^ • :^y T 

zn>< h 7='t Kn^-^— if (GAP-DH) C0:/n^— — 
*tJffiLrSSSeH<?5iSig (1#ga¥3-187392) (DS^/^ 

[0 0 0 51 

;^ • ;^-yifft3l^(DffM^^^i^^--i5'--^#. ro^^^ 
20 [0 0 0 6] 

^n^^mir^^.xmm.^j^h<^xh^o 

[0 0 0 7] *^M(-*5v^r(^. WTcoJ: 5{-LT±IB 

rxm<^m:r^ikB^<7)m^tmnm. ^^T<D^o^^ 

30 LTT*#5o 

[0 0 0 81 ^iiKm^^/^='-^'^^-<^^*Mt Vti 

m-t^^ rO/Ky A^+a^RNA^fflV^TcDNA^'^fiSc 

m'^^J:^^^-i:^^^XcDNAy^:fy])—^m 

[0 0 0 9] r<7>^^:/^ y-'CDtt^^^^^iiLfc^ n — 
l^^mi^^^C^Wc. 0iJx.«5OO<@jSS'J (^?tF8,000xteT-T 

If fi!c^?ttTv>-5a»Tfcnfi«^500fii?ift^T*t ^ 

X.e>tl6o ) T"^;^?: KDNAS:|ftaLT«^800bp 
40 cOi^Sia^'iSrftlWi-^o 

[0 0 101 ::^LXnbtlfcm.mMmt. ^cDNA 

(7)=i-Ymi^^5' i^^mfgt(ommsm^^^. dna 
m¥\mxm^^fc^x^miiip.m^n^:^t{::.x^. 

-So 

[00 11] m^. z(Dm\^Lxm'^\^rz^y^=^-^^ 

50 ffiTT*3S<5la.bTV^S^a^S^cr>:/^^-^-(D*gS 



CR (Polymerase chain reaction) S:ffll''*S:^jfe(^i 

[0 0 12] pcR^m^^i>:)J^<o—mt\^x. £tT« 

ttWLTE c oRV. Scal.Dral.PvuII 
fc5V^I4S s p lifoe b p$r^>!SLT¥?t*»ffil-W»f 

[00 13] ^(Dmm^r^y^-am^^^i^f^^y 

DNA»f>i'^»at b'Cv c DNAO:feSi£^Jd^b-&fiK 
Lfc:^7 -^-SS ±1^7- ^7°^' -(^-^^iSS^^^ia?']* 

$t^i<tJ)T-5l^LT 1~2 k b pgScOft^^^-rSDN 
AWfK-5r«^^»(^J:oT*g|-»KL. ig^/<C^D- 
:^ ^ ^ - {;ijg^ L T (r n V ^-r 5 o 
[0 0 141 |S<@»3fe:S:Lfc;J'»3— i^d^^^^^S K«r 
flSUT^' C3— =i:^:/LfcDNA{^>i-Oi^*ffi(^)^Sia 
?lJ^fi?Wb, c DNAt+ai^/iiiSia^J^WLc DNA 
(J^ioSlcffiS-rSDNAKrK-^-W c DNASjfeWjt^s 

[0 0 151 ^a^-^'-®i®SrW-r5DNA»f>i-w5' 
aiRBI*^^ KV^tpigSl k b pgSK±?r*-r'5 
DNA»f>T-SriiSiJL, ^SSia?«J?:*Sb> ^/i^n-^ 

I 0 0 1 6 1 ^— ^ -(i±aiC75 J; 9 I- t T 

0ijx.fi. 0.1% SDS (eCC, 0. 3mol NaCl, 0.03M 

[0 0 17] ^^^eq<07°n^— ^— tt, -tc^Titt-x if 
[0018] Iti^r LTfi, T;^'~«/^^/^;^ • ;^!;if Sr 
[0 0 191 •roWffl^' V^^^'® 

■fe KSit^ii'toR^, m^rss^bise^^. :^!; =f' 



(3) ■ 1 1-2 7 6 1 7 0 

4 

[00201 «±CDfl^®feglt i t|s:<2t*D<0*ifeT-fT 9 ^ 

(002 11 eiT. *liSeiJSr#RSb/j:;iSb*IIW5rPi(Sl 
10 [00 2 21 

(1) a^^<^Kt# 

T-y^^/l^df/W^^ • (Aspergillus oryzae) ATCC 4 

2149«:iJAT©««*ft=-C'*«I-^^o YPD^lfe (^»^ 
^;^1%, ;jfr}-<7°h>'2%Rt/^/^='-:^2%J:>!)?'i 
(pH6.0) |i*et*^JKaL. 30'C(C*3V^T22»# 

[0 0 2 3] (2) c DNA7-Y::^7 y — COf^M 

'^iZ-f (chigwin) C/W •^'r^:^V')~- (B 

iochemistry) . 18#, 5294H (1979) ) {c:ti§oT4.6mg 
co^RNASr#fc„ ^V^ (dT) ir/na-^^ • 

(77/^■^VTti:) J-^L. l. 4ing<Z)^RN Aii^ 
e>30// gWJK y AWJPRN APi^)^Sr#fco 
[0 0 2 41 rcDJi^y AftiPRNASrfflV^T, No t I 
SlJPSffi^^^WSPffi^r^tf ^7 (5' -TTCTAGAATTCAG 
CGGCCGaTTTTTTTTTTTTTTTTTTTTTTTTTTTTTVN-3' : r^-C 
30 \m,C,G(Dm-^. NttA,C,G.T©ig'^Sr«-r) SmfflLVD 
f.^^^;;?.^ iJ7"^r— if (RNaseH-) (i? n Vt" -jy 

ibliE.coli DNA polymerase I n:>'T-yi^) v E. col 
i DNA ligase (^'a:xx5'^') iSitJfE.coll RNase H 

c DNAt LfCo 

[0 0 2 5] c DNA^^«lffl;»7i>. (i'Ol^T-y^) ^ 

fflv^^cy/^^Ilia^::J:'?M^cDNA^rl^*Lytm, Ec 

0 R 1 mmmmmmu&:^-i;tiriify^~ (s'-aattcgg 

40 CACGAGG-3' S:U«5' -CCTCGTGCCG-3' ) ^liSSL/'rio 

[0 0 2 6] r 9 L-C#ibHfc:cDNAWr>i-«. Not 

1 mmmmx'^±mit u^cm, e c © r i *s jco^n o t 

{Sl^^^ftOc DNA^^tf^cB&ScDNAy 

(0 0 2 71 (3) susi-rsiseT-ro^^ 

5 0 0<BSBIJL. y^y:^^ KDNA^ttmLT. Ml 3 
50 =L=^y<—i)-/U-:;^yy(-v— (5' -CACGACGTTGTAAAACGAC- 



(4) 



^mW-l 1 -2 7 6 1 7 0 



3') . Taq DNA/Ky/y— (/>?>^:^ V ^n/U-e — 
-a) St/^DNAV-^^3l>--^^- (LI-CORtt. ^ 

[0 0 2 8] r 9 UT#^,nfcJfiSia3^J(i. ^cDNA 
*ffi®*S(Z)iaSiE?IJS:'^A/TV^5o GenBank D 



(0 0 2 9J Sequencher (GeneCodes^t) ^ffl 

[00 3 0] 



Gene naie 


Score 


Source 


Alcohol dehydrogenase I 


11 


Af 


Coproporphyrinoffen III oxidase 


4 


So 


Hexose transporter 


5 


Sc 


Hlston H4.1 


4 


An 


40S RibosoBal protein S17 


2 


Sc 


40S Eibososal protein S28 


2 


Sc 



[0 0 3 1 ] Sif'difc^l/^Tti, &.T(OX 0 (^^-To 
Score : ^ :x^At;i3lA//c600(DcDNAgfi?iJftWT'COffi^ 



Source :GenBanky=— ;^ l-^J-t'-S a ^J^"^^ 

Sc=Saccharorayces cerevisiae 
Af=Aspergillus flavus 
An=Aspergillus nidulans 

10 0 3 2] ^(D^^^ V/\^^-::^^^TXm<^^\. 
TV^'Siie^t \^XT/\^^—/\^ ' "fM ]^u^'r—^l (a 30 
Icohol dehydrogenase I) iHS^^ ::i':fn7^;V\i ]) / 
^y>lll:^^iyy-—^ (coproporphyrinogen III oxi 
dase) it'g^, ^^y — :^' h^V-X?}^— (hexose 

transporter) M&^. tP^h>'H4. 1 (histon H4. 1) 
MG^. 40Sy 7j?>^— ASe5ttS17 (408 ribosomal prot 
ein S17) itin^. 40SU /I^V— i:^^fiHS28 (40S ribos 
omal protein S28) M&^^$ifz{Zm^Ltio 

[0 0 3 3] mmm2 T/1^=^-/1- - xt: Kpy':^— 



Aspergillus oryzae RIB-40^ (ATCC 42149) fi^h'T ^ 
^ t/^— K (Timberlake and Bernard) <0 

:^m^v^'^xmc^y j^DNA^. eh p^tmLxw- 
m^m:immi-^EcoKvx^±mit (p) 5' -acctgccc- 

3' (NH2) id -CTAATACGACTCACTATAGGGCTCGAGCGGCCGC 
CCGGGCAGGT-3' fi^htS:hT ^^f^ - b fz.^ 
[0 0 3 4] rco^y i^DNA»f;T**^M<t L. ir^/;^ 

i^xr^-^^^-co-^m^'Amm^m-r 

5 4-^ — AD (5' -CTAATACGACTCACTATAGGGC-3* ) 
T>^^±:^:^y9-f'^—tLXr/i^=i—/U'y^\ih* 

ny-^^if I c DNA(Dmmmm^b^^Lfc^y-f 
# 1 (5' -ACGAGGGGCATCTTCACTGGGAGG-3' ) Srffi V ^T 

PCR^frofrio PCRS^Sti, Expand HF 

;3tf— • •^:^/N^i^tt) S^fflV^, DNA1^— r/l^ • f-^:?^ 
(/>-^>^ • :3i/w--a) |;i J: 19 fro ;rco 

[0 0 3 6] 

(j!*lfi«) 



[10 X] React ion buffer 

dNTPs, Mix 10 mM 

irv;'^7'y-f (/ 7>f7-AD) 

T>'f"^>:^y"y^ -^ — (f y^-r-ni) 

* template (DNA 0. 2//g) 

Expand HF DNA/K y ;^ y —if ^ ^ :^ 



20. 25 
2.5 
0.5 
0. 25 
0. 25 
1 

0. 25 
25 



1 

fi 1 

H 1 

M I 

// 1 

U 1 

/i 1 



[1 X] 
200 //M 
5 fiM 
5 mM 

1. 25 u /TEST 



*EcoRV^g?LAc:DNA^0. 2//g/10// 1 J: ^ {CT EX t ^^^Lfzi^cO 

[0 0 3 6] ±m(ORmim5/i l ^0.2mlR/£^^- r(OX^f^m.!^m'^KJ:y):^Tiy:fy^: 

-f^XU^LXDNA Thermal Cycler i::-ir:y h 50 tCo 



7 

[0 0 3 71 

sou? 74t:. 15^^ 70X:. sf-^^/i^ 

95^. 30# 66t:. 15^^^ 70^. 35> 31^-f ^/l- 
95t:, 30# 62t:. 15# 70*C, 3^ 31^^ ^ /I- 
95^:. 308^^ 58t:. 15# 70t:, S:^' 31^><:5^/^ 
95^. 30# 54t:. 15#^ 70T:, 35> 20-9-^^/1^ 

[0 0 3 81 iii|gj£^^1.5%7';^n-;^ . ^/l^mUi^ 
mx*mW^LX2,0kb(DDl^Am>T'i:"^tio r<^DNA 10 

— — pCR I I (^>^l:*hni^3:: W±) 
[0 0 3 91 ^(D^i:^—i^ib^h:f^::^^ K^PMLT^ 

1829bp<^ D N A®*S ^ L fco 

[0 0 4 01 r<^>^t{;iLT. 1829bp(Dr/^=i— • r'e 

[0 0 4 11 'mmms =3^n^;Vt:^ y 7-^:^111 
:^dirix^-'^(^c DNA(;iji>rjS'r5<^):?'p^-^-M«£ 

^l^td, 6 b p ^|f.ig^UT¥?&*iffii-^»f-r^*JPfi»* 
t LTD r a I ^^V^fCo 

[0 0 4 21 T^^^ir^y^y^^^-^—i: LT. =3 30 

SSa^^J^^^b'g'^Lfciy^^-f -^-ft 2 (5* -OATCGCTTTTCCG 
CTGAGTATCTG-3' ) ^rfflV^fCo ^ 9 bT#e>ttyhl887bp(^ 

[0 0 4 31 ^^^4 ^^y-::^' 

=3f/Uy^:^yif(Oy/ ADNA^^^r^fb-r^l^Je:. 6 b 40 

p irtmL^x^m^t^icmm-t^mmmmt lte c o 

(0 0 4 41 T>'^-t>';^:/^-f -^--i: bx. 

^ Lfc -e— # 3 (5' -CTGGGGTAGGAGTATCGGAGTGCT- 

3*) ^m^^tCo C 5> LT»^ttfc2091bp(^)^^y-;^ • 

[0 0 4 51 ^teW 5 t: >^ h >-H4. 169cDNA{;i^'r5 
P^-^-^^c^)St# 50 



#M¥1 1-276170 

8 

y i^DNA^^^r^lk-rai^j;!. 6bp^^vaLT¥if 
7kffil^93»f-r6aRB^*t UTS s p I ^fflv^fco 
[0 0 4 61 Tv^-fe:^^;^:^^?-^ -^-i LT. t 

>^ h:/H4. Ic DNAO:eSi^3?»J;5^?J^figU^c7'^>r'^— 
#4 (5' -AGGTTGAGTGGGAAAATGTGGAGA-3' ) ^rfflV^fco - 
b LT#^*xfcl915bp(^t:^ h ^/H4. lififi^^^^ST^n 

[0 0 4 7J HtSWe 40Sy /-K>^-ASag$17<7>cDNA 

*ifeW2 t|iHi{-LT. 40sy /l^V-i^seKsnitfeT- 
;^y•^C9<5^yi^DNASr^^?mt:•r§^^-. 6b pS:^ 
^LT¥m*«l-§JWr-r^Wlfiffi^*i: LTE c o RVS: 

[0 0 4 81 r>^^ir>^;^:y?>r-e-t LT. 40 

sy Jpy— i^Se!SS17 c DNACOiSSiE?iJd^b^^b 

-f # 5 (5' -<;CTTGGGGTAGTAACGCTCAATGA-3' ) 
?rfflv^/S:o 5 LT#fe>ixfci569bpco40sy jJ^y'— i^g 

: 5 {C7Si"o 

[0 0 4 9] ^ifegiJ 7 40S y 7Ky^-i^^agS28(^>cDNA 

'mm^hwmz-x^x, 4osy 7^^y^-i^geRs28^teT- 

:^y-^<^yy ADNA5:^^?^ffc-r5I^J-. 6bp^|^> 
iaLT^y&*«{c9)»f-r5««|SB^*i: LTS s p I ^ffl 

[0 0 5 0] *;rc. T>'^ir>':?^y^^>r-^— t LT. 40 
sy /Ky^— ASeKS28 c DNA<7)J&SiB^J;0^e5'a^fi£L 

y" ^ # 6 (5' -AAGCACCACCGAAGGGAGAGGACT-3' ) 

^fflV^fCo r 0 LT#e>n/c:1631bp(D40Sy /-Ky'-.^fi 

# : 6 XZ.Tf^'t. 
[0 0 5 11 ^J^^'JS T/l^::^-/l^ ' Vxi^^-M. 

(1) :^HiSS5Ri3-i//v^n::::i5/'— ^iteT-^a:^'fe^5' 

»J£<Jij2tr5t#L:^::*^2.0 kb(Z)DNA»f>T'^^M{^LT. P 

-i$'-S:'&t>1.0 kb<7>DNAWrJtS:#fcp PCRRlSlwfflv^fc 

ir>';^y'y-f -v— : 5' -gtcgacgaactattacggagtacatagc- 
3* 

T y V — : 5' -GTCGACTTTGACTTTGGGATC 

TTGTGTT-3' 

[0 0 5 21 r(^>»r>t^ffiiPg^3NSal I :kfli® 



(6) 



^mW-l 1-2 7 6 1 7 0 



10 



J275 (4.5 kb. ^uyy'y^m cr^SallgUffitCljf AU. 

m^:^\^f)m:)j\^i^^7^^tirz^y:^K KpBRJAP (5.5 

kb) ^#fCo 

(0 0 5 3] r:^^/u^/uy^ • ;^i;-tf(7>^y 7- 

■^ae^«:^tp2.9-kb BglllKrJt (Biosci. Biotech. B 

iochenL, 60#161-163H. 1996) ^^b^ ^ ^ —-^MBJ- 
cD^-^ ^-i^-mi$.0. 6-kb EcoRV-HindIIIff>lT'^pBlu 
escript h y i^ iy—yft) (^EcoRV-HindIIia5flL(c:f- 
y^D--n>^t. T^^;^^ KpBET (3.6 kb) Sr^fCo 
10 0 5 4] 7"^;^ 5: KpBRJAP:^^bT/Vn • x't K 

*/3-i/yl-^ n^y— ifite^^^tf2.8-kb Pstl-EcoRI 
»fM*^©#b> rtt^y^y:^^ KpBETOPstl-EcoRmffi 

m;^ir5/ h^^tPT'y;^^ FpBAPETG (6.4 kb) 

(0 0 5 51 (2) ±mmAMff-:r/i-^^=^y-^m^ 

J: 9 . 3. 4-kbCC):fcfl&Mft i3 n n>5^^-^ite^ 

n^^'^-:^-tLXmam&^^^1r^-:fy:^^ KpN 
3 (1tFf8¥6-245777) C0PstI-HindIim{4(c:i¥A * 

(D>^y:^^ KpNAPETG (11.6 kb) 5r^^L/ho 

[00 56] (3) :kmm^Mfi-^/^^^:=-'^-'^i&G 

•fy:^% KpNAPETGSrUnkels h<D')j^ (Mol. Gen. Gene 
t. , 218# 99-10431,1989) (Ct^oTT^^^/^^/l^:^ • 
-^niaDj^tattAOLUC^ALfCo #btLfcff^Kte«^ 
fl54-3%ir/W=3— ;?^XUf50 jLtg/ml 5-:/n^-4-:J^nn-3- 
>f K— /l^- j3 'Xi-^JV^ n KSr^tf Czapek-Dox:/ 1/— * 



* h±T3OCT'lO0Fp1j§«Lfci<t =3P=.-;!55#fe^ 
:7'7;^^ KpN3ft3R<^?l^»temtt:|^Wfe^ML;^^;5^o 

[0 0 5 7] Wfe^TF 1 oSrSi^^J^jfe 

10 apek-DoxlSfl6T*8^W««L. #bttfcMft<?5«l«i»att 
i3 -^/l^^ n Jefferson 
(Proc. Natl. Acad. Sci. U.S.A., 83^. 8447-8451, 
1986^) T*9J^Lfctr5. tt;Sttli78 nmol p-:^ hn 
:7:j./-/^/min/mg-seS'^fcofco rtut::«L. pN3 
6 3feff$Kte^^^ < Inmol p-^ Vxiy:x.J — /V/min/m 

[0 0 5 8] 

*^W(-<J:*?. r;^^/i-:af/u;;< .;^y-^^ 

[0 0 5 9] 

\mm\ 

mm^ : 1 

: 1 8 2 9 
i2^J<^ffiS : Genomic DNA 

30 &m 



1#iSSr^i~6E-^ : promoter 



ATCAATTTAA AGGCTATAGA TGGCTAAGAC TTTGGGGGCG ATACAGATCG GAGTGTTGGA 60 

AAGGGGCTTG CTTTCGTTTG ACAGCATAAG CTATCGTGGT GTTACAGCGG TAGCAAACAC 120 

GTGGAACAGG AATCCTTATA HGTCTAGAT CATAGACATG TCAAACATTT CAGACAATCC 180 

CCTTCACTAG CAGACTACAC ACCCAAATGC TATCAATGAA GAAACGGACA GAATTATATG 240 

TACTTCTACA GAGGGTGGAG GTCAGGTGGC TTCCGAAAGG TCCTTCGCCC GTAAGGTACT 300 

CCTGCTAGGC TTTGAAAGTT TTAAAAACCX: CGCTATCTGC CCTCAAAAAC TCTTTTGTCT 360 

TTCTCAATTG ACAGGAGGGT CTGCGCGGAG GATGTTATTG TCTCCGTATC CATGAGTGCG 420 

ATGAGAGAAG ACAGTAATGA TGACGACGAG AACAGAATCA CAAACAAGGT GATCCTACAG 480 

ATGTTACTGA AGACACCATC TGCTCTGGGT CGTAAAATTG TCGCTGACTT TCAGGGAGAA 540 

GTTCAAGTCT TTaTTTATA GCAGGAACTG AACCGAATCA CXXSACaCAT GTTCCATAOG 600 

GaCGCTGGT TAGTCGTCCG GATATGGGGA AGCTGCCGCT GTTAGGTATC AGGATTATAC 660 

ATTAATTTGC GGTGCCTTCC ATATGTTGAG CTCCGTACGT CCGGGGTCAG CATGGGAAGA 720 

ATTAGATGAA AGCGCAGCCA TCCCGCATCT CAGCAAATCC CGAGAAAAAG CAAGCAAGAA 780 

GGAAGGGAGG GGGTTTAACG GAGTAGATGG GGCGAAGTGG CTGCCATGCT GAACTAHAC 840 

GGAGTACATA GCGAGAAGTA CGAACATCCA CCAACACAGA AACGGAACTA GCGGCTCACA 900 
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ATTCCGCCTG AGGCATCCAT GCCATCCTTA AAACTGACTA GGGTTGCCAC TTCCATTGGC 960 
CATCAGTCGA TCGAAATCAT CTGATTTGAA GGGTCTCGGT TGAAACGACA CTACAGTATC 1020 
GAACGGGATT TGCTTaGGA GTCTCCCAGC TTGAGGTCGA GTCTATCTTA HCGGTCGTC 1080 
TATCCXXJAAA TTAAAAGCTT GACGTTTGAC AGGGAACCTT TAAAAAAAAA GAAAAGAAGA 1140 
TCCTTTCTTC TTTCCCCGCA TGAGATACGG CCAATTTTGC TATATTTGGA AAGGCTTGGC 1200 
CCAGGATCTG GACTCACTTG ACAGGGAAGC CnTTCGGGG GAAGGGAAAG TCTGTGACTC 1260 
TGTACGGGAG TACTACTTCG TACTTACCAC mCTACTTA CTACTGACTT GTTAGTTTAA 1320 
AATCGAAGGA AGTGCTTATG CXiCCATCGTA CCGATCGGGC AATGAGCCTA ACCCCGATGA 1380 
CGTCGGCCCA TGGATCTCCT GTTGTTGCCG TTCATTCGGT GGGCTGATAA TCTTCCAATG 1440 
ATCCAAAAAT CXSTTTGATCG TaTGGAATA GaTCGGTGT GTCTCACAGA TTCTCTTTCA 1500 
TGGTTTTTTG ACCTTTCCTT TCCCTCTCTC CGGTGACTCG TTTTaATAT GTCCGTCTAG 1560 
ATGTGCGTGT GTATCACTAC AACACCATTC ACCTTCATTC ACCAaATTA TTCCCACCCT 1620 
ATAATACGAC GTCATGCCTA TGAGATTCCC GGGTTGGATG ACCGTCACCC CACCCATTGC 1680 
TTGGATTCTA GACTTCTATA AGAAaCTCC AGCGTCTCAC CATCAGCTTA CTTCACTTTC 1740 
TCATCACTTC TCTCATCGTG TAAATCAACC CTAACTCCAC CXJCAACCTCC TTTCTTCCCT 1800 
TTCTTTCAAC ACAAGATCCC AAAGTCAAA 1829 



TAAAGGGAAT ACGGAAAACC TGATATAAAC GCCTGCACAC ATGAAGAATC GCCGCAAGTC 60 
TAACCAGCTC ATTCGCCTCA TACCGGAGTG GCTGGTCACA CACCCGGCCC TAAAACAAAA 120 
CACATACTGA CGGCGAMGAG TTACATATCT TGGGAGAACT GCTTATCAGA GCTTAAAGAT 180 
CGCCGAAGTG TTTTCCCCGG AATACCTAGA GACTATGTCA ATATGTGCCG AAAACCGCCT 240 
TTGAAGAATA AAGCATAOGT GATATAAAGA AAGCCATCTT CATCCTTGTG TTCTTCGTAG 300 
ATGCTCaCA TAAGTGCAGC TGTTGGTGGT AGCACCTCGT CGACGAAGAT GAAGATAGCC 360 
TTCTCGGGAG ACAGTTTGAT GCGCTTGCGG ATAACATAGA CGAACTGCCC GACGGTAAGG 420 
TCTGCAGGAA CAAGATACTT CTTCTTATCA ATAGTGGCGA TGTCCGACTT CTCGACTTTC 480 
TCACAGATTA CCTACAAGTG AAACTCGAGT TAGCATTCTA AACAAGCGGT GAGTAGACAT 540 
CTTGaTCCC TGAGGAAGAA CAGCTCAAGG GCAAGAATCA laACCAAAG TAOGGATCAG 600 
ATAGAGTTGT GCGTACTGGA ATGCGATCAG CGTATTTCTG ACGGATACXJC TCGGCTTCTG 660 
CCTTGCGCTT CTCAAAGGGG TGCTCXJTCCT TGAACTTGGA GCGCATATTG ATGGATCGAA 720 
TCAGTTAATG GATACAACTA GGTCGGTTTA CTATGAGTAC TAGGTACATC AACGAGATAG 780 
ACGTAGATGG TGTGAGGTGC AGAGTGATGT AAATGAGATG GTTGGTGTAA TAGGTGAACA 840 
GTAGGAAGAA GGTGGATGAT GACAAAAGGG AGAGCAAAGC CCCGCGTCAT GGGTAGTACC 900 
ACGGCTGATA AGGTCGTCAG GCAAGAGCGG ATCACXK5GGT GGAGGCGGTG AGCAGCCCCA 960 
CACGTGAGTC ACGATCAGCC TTGAGCCTTC AGGTGCCCCA AGTCATCCAT TTAATTTGAT 1020 
TTGATTTATG TCATCTTCAT TAAACAGATA GCTCATGGAC ACACGTCATA TTCTCACGTG 1080 
GGGAGCCTGG GTGATGTCCC GGTGATATCG GGTCTATAGT GCCCTACAAA GATGTACTAC 1140 
TGAACTGAGT AGCTGTGGGT AGATGCGGTG TCCGCCTCCA CCTGATCAGC GGCTAGACTA 1200 
aGGTGCGCG CTGTaCCAT AGAGGGCTCT TGATAGGCTC TGTTCCTGGA ACTTGTAGTC 1260 
GGTATTACAA TCATAGTCAA GTTCCAAGAG TATATATAAT CaAATTCTG TGATGACTCT 1320 
CACTCAACTT AATCTCATTA CAAGCTGCAT TATGACAAGC CAACAACTTT ACATGCATGA 1380 
TCAACTCGAT GGCCAATCCT CAACGCAATG CAAACCAGGC AAACTACTTA TCACAGCGTC 1440 
ATCTGACTAA GCAGGAACTG GGACATGTGC CAAACAATGC AAACAGCGTA TATCCCGGAT 1500 
ATTCTGACGA CGAGTGGTCC CATACAGTTC CAGTCCAGAG AAGGTAACCT CACGATTAGG 1560 
CTACCTCTAT TTCTTGAGAT AGGTGGAGCC GTTTGAGAGA GCGTGCACCC ACTCATATCG 1620 
TTCCGGTACC ACCTGATCAT TCCAiaATC AGCCAATTTT CAATTCCATC TTATAAAACT 1680 
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®) WmW-l 1 -2 7 6 1 7 0 
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ATGACCTTGT CCCGGCAATA TATGCCTCTT CXK5AGGTCAC TCTGCCACTT CTCAGCGCAG 1740 
ACCAGACGCG CACCTTGCAA CTTTCAGAAT ACATTTCGCA GATAaCAGC GGAAAAGCGA 1800 
TCCAGCGAAC CGAAATCTTT TGTCGTGTGG AGGCCATACC TCCGGCTGGC CATTGGTGTC 1860 
CCAHCATCG GAGCTCTGAT ATATTCC 1887 
[ 0 0 6 1 J m^m^ : 3 * S^mcomm : Genomic DNA 

ia5>j60fi$ : 2091 gjjg 

h .i^n : * W&Li:7jkirm^ : promoter 

GCARGTAGAH TAGAACACAT TTGaATCCC GTAGAACCGC CAATAGTAAA AWTTGCGGTG 60 
CCATAAGGTC GTCGAGGAAC GTCGGTTGGT TTCCATGAGA TTGCCGCCGA GCAACAATGC 120 
ACGACCCGGA TTTTGATTTA CTGATTATAG GATCGTTATT ATAGACAGGG AAGGAAAAAG 180 
AAGCAAAAGA CAAGCAAAGA CTATCAAGAG ACATGAAGGC AAAAAAGCTA TACAAACACA 240 
TGATTTAGGT TGCATGATGA CTTCCGTGAC TGTCATAaA CCGTGCAGTG GACCACXJAAG 300 
AGAGTCGCCT AGATGCTCGA CTTACTARAC GGGTTAACCG CTTCTGTTGA TCTGCATGCA 360 
GACTTTTCCC GACTAGACTC AATCCAATAA GGCCGGCGTC CGTACGGCAT GCCGAKATAT 420 
aCXJTGTAGG GTAACGCCGT GGTTGTCACC GGRGGCSGAA ATGGTCGAGT CTGAGAGATT 480 
TTGOGTCGGC AAGTCTGCCG AGTGGGACCA CGGCCAAaC AGGAGAATAG GAAATCTCAG 540 
CCGTTGTTTG TTGAGACTAA AGTTGCACCA KTCAACATCG GCTCTGCTGA AGTCTGCACT 600 
TCTCAACAGT GGTTGGATGA AAATCCGACA GTGGGTCGGA CGCCAAGTGC CTTGATGCGA 660 
CXJATCCACCA GTCGAACXJCA CCAGCAGAAA TTTTTCTCCT AAGTTAGGGC GAASTATTKC 720 
GTATGTGACG TCGAGTTGAC CGAATCGGCA CACCGAGTCA CAAACGGATC ACGNTGCCGA 780 
CTTCTCGGAC GGCCGATAGC TTCGGAACGN TTCCAATGAC ATACATACAC AGCAGAAGAC 840 
ATGCATGGGA CATTCTTGTA GGTATTGTAG TCAGATGATG TTACCTTCCA CCCAATATTG 900 
ATTTGTAACC TTCGTATCCA TATTATTTTT TGAACCAAGA CTACAGGGTA TTGGCACAGA 960 
GTTCGGATGC TGCATGAGTT GACTTTATCG GCATCATTAT TACCTTTCAC TACCCGACTC 1020 
GGGATCGCCC AGAAATTTTT TTTCTTAGGG TGGATCCATT TAGATCTGCC ATGACTCTTG 1080 
GTACCAGCTC GGAACCAGAA AACTTTACTG TGTGTCTTCA GGGGAGCTAT AATTAAATTT 1140 
AGGGCAAGGG GGAAAAAAAA TGAGTGAGGC AGATTTGAAT GGGCCTCGTT AACATCGTTA 1200 
AAACAACTCT GTTGAGAAAG GAGTCAAGGG ATGTAGTGGA TCGTCAAATG TAAACAATTG 1260 
TCCGTAAATA CCCCCGTACA CACACCAACA GAGTAATTGT GCGTGTGAAT CCTGAGAAGT 1320 
ATGTGGCAAA GGAATTAAGC GTGTAAACAG AGTCAGTGAA CTAACTATTA CTACCAC(XT 1380 
GAGACAACAC GGGAAAACAA ACTACGTAGT ATCTACAGTG TGGAATGGAA TAATCGAGAT 1440 
ACTACCATAG GTACTACAAA CTTGAATTTA GAGGATTGGG TAATAAAAAA TAAAATAAAT 1500 
AAAAATAAAA AATAAAAACC AAACCAGACC ATCATGAGAC GCACATGACG TCCCCGTGAC 1560 
TACTTTGGTA GAATTCCAGA CTCTCTCAAG TTCGCTTTTC ACGATAGCAA GACGGCGAGC 1620 
CCGGCAACGA ACCTCAACCC ATCCATGCCA ACGATAACTT TACTCTTTTG GTCGTCTCCA 1680 
GGCACATCCC CACTGTCTCC CCACTCCAAG AGACTCTGTC GAAGCATCCA AGAACACACT 1740 
CCGCCCCCCT AGCATGCGAC TATCCTTCCT GGTTCGGCCC TTGAGATCTG GGCCCCCGTT 1800 
TTTCTTGCCA TGGTCCCTAC CCCGAGAATG GGGTGTCCTC CAATCCCTGG GATGGAGGCG 1860 
ACGTTTTGTC TTGCCGACGT ATGGACCGTT CATGTAAAAC CTTTCATTTA ATTTTCAATT 1920 
AAGCGCACTA AATAATTCCA CGATGCGGTT ATTAAGTTCC GACTCTGGGT TATATATCTC 1980 
CGCGCTTCCC ACGTTGCCTC CCCGGTTTCC TTCTCTTTCT CATCCCATCT TTCATCCaT 2040 
CACCAATCTG TCTAAGGATT CGTTCGACCA TTGAGGAATC CATCCATCAT T 2091 
[0 0 6 2] Smm^ : 4 m^cDmm : Genomic DNA 
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ATTCTATAGA CTTGGTAGAA GTAGAACCTT CCCTGGTCXST ATATGGAGTA CC3GAACGCGG 60 
TGGGCAGTAC ACCAGCAACA CX^ACCAGCGT ACACACGCTC AACAGHCGC GTCGCTAAAG 120 
GCGCGCGACC GCGATAGTGG CATACTTCTT ACAAGAGAAA ACAGCGCGAG ATTAAGTCAC 180 
GACAGTAATG AATTGGATTA TTCGTCATAG AATCTGCATT TCGAATCTAT GCCATCCGGT 240 
GGCAATTCAT AGAGAATACC ACTTGGTCAA GTAAAATTTC TTAATGTACA TGCTCGGATG 300 
CATCaATGA GATCCAATCC ACTCAGCCGC GTCGTCGGCA GCCTACTATT TATTGCGCAG 360 
CAAAACGAAA ACTAATCCCA GTGCCGAAAC CCATTGATAT TCGCAAATGA AAAAAAAGAA 420 
CCCATCATGT ATCGCCAGTC GTGAAAAAGA AATAAAACCC CTAATCGCCT ACCCAGAGAC 480 
AACCAGAAAA ACOGGAaCA TTAGAGAATA ATCTAAGAGC GCTCACCACG GAGACGGCGG 540 
GCGAGCTGGA TGTCCTTGGA CTGGATGGTA ACACGCTTGG CGTGGATGGC GCAGAGGTTG 600 
GTGTCCTCGA AAAGAGAGAC GAGGTAGGCC TCAACGGACT CCTGGAGAGC ACCGATGGCG 660 
GAGGACTGGA AACGGAGGTC GGACTTGAAA TCCTGGGCGA TTTCACGGAC CAGACGCTGG 720 
AAGGGGAGCT TGCGGATCAG AAGCTCGGTG GACTTCTGGT AGCXJACGGAT aCACGGAGA 780 
GCGACGGTAC CTTTTCATGA CTGTTAGTGA TGACGCGATG ATGGTTTGCC ACATCATAAG 840 
ATAACTTACC AGGCTTGTAA CGGTGAGGCT TCTTGACACC TCCGGTAGAG GGAGCAGCCT 900 
TACGGGCXiGC aiGGAGGCG AGCTGCTTAC GAGGGGCCTT GCCACCAGTG GACTTACCTG 960 
GATGGATGTT AGTGATGATG ATTCAAGACG CGATGTATCA CTCATGGGGA CGCGACCCTG 1020 
AOGCGGGGCX; ACGACACCAC AAGAGAAAGG ATAGTAAGGT TAAGACTTAC GGGCAGTCTG 1080 
CTTAGTTCTG GCCATCTTTA GAGATAAGGT TTGATGGATT TAGTCGACTT TGAGATGAAA 1140 
ATAGATAAAA GTCGATGGTT GTCGATGATA ATGTGTTGGC GGATGAGGAG ATGAAAGAAG 1200 
GATGGAGGAG GCGGGTGGGG GAAGGTATTT GTAGTAAGCG AAGGGTGGGT GGTGATCTGA 1260 
TGGCGTTGCC AGCAATTTGG GTGGCCTTAG CGTCGTTTCC CTTTGATCAA AAAAGCGCTC 1320 
GGAAAGCAGC ACACCATAAG CCGTGATTTC GGGGGTCACT TTTGGCTGCA GAATCAAAGG 1380 
GTTACCTTGC GGGCGGTCAA GGAATCCGCG CCCAAGGCAG GAATGAAGTG TGGTCTGTGC 1440 
GTTAAGCGAG TGACGGACCA ATAGAATCGT AGGATCACTG ATCTAGCCTG TCTCCCTACC 1500 
ACXJTCTTGAT CCCAACCACC GGTAAGGTAT GTGAAGTAAT TCAGTAGATC AAAATGAAAG 1560 
TTGGTATGCT CAGGAATATC TCTGGTCAAT TTTGATCCCT GATCTCGGTG GTTCAGAGGC 1620 
ATTCCGTGAG GATCAAAGTT GATCGTAGCG GAATCATCAC TTCTCX}GTCA CGTTGCCGAC 1680 
AGATCAAACC CGCCGGGAGG ATCCTCGTCT CTCATTGGTC GATCCTCTTG TAGCGCGTGA 1740 
TTCCAACGCC CTCCCCGCGC TCTTCCTTTT CCTATACAAA TATCACCCGT CCCATGGCAG 1800 
aGTCCCAAT CTTTCCCTTC CCATCTCTCT CACATCTGTC ATCCAATTAA CCACCTATCT 1860 
CTCCACATTT TCCCACTCAA CCTTAAACAA GTTAACTCTT TAATCAACCA TCAAT 1915 



ATCGGAGGTC TTGGGCTCAA TGAAGTGAAT ATATAGTGGA GGGCTCTCCG CACTACTTTG 60 

TTGAAAGAYA ATCGATACTT GCTCX5ATTTA GTAGTCAAAG CGCCGAACTG AG(XTTTCCA 120 

TGGCAATAGT CAAATGAAAT AACTACTGGA CTAGATCTCA TCATCAAACC ATTGTATGTT 180 

TAGCTCACGG GGATGTTGTT CCTCGGTTAC TACCCGCTTT GTAGAGTCAT GGAGCAGAAT 240 

TCCGAAATGA GTTATAGTGG GCAAAAAACA TTATTCTTAT GTTGCCAGAA AAAAATACTA 300 

CGTTTTTCTC CTTCAACCAT TGTCTTGCGA GTCAACTAGC AGCATATCAG ACTGTGACAG 360 

TAGTCCGTTG CCCTCACCGG CCTATCGGAC GAGGTGTTTT TGTGTTGCTA CCCTTCTGGG 420 

TTTAGTGAGT TGGACCAGAT TTTGGTAAAT GTTGCACGTG ACTCGGGTCA CATGAGGAGC 480 

TCTTCAAAGA GTGCGCTATT CCTGAACAGG TACTCTTGAC GCTAGGCAAT TTCAGGCACA 540 

ACTCAAGGCG CCCGCAAGTG CCCGCTCAGT GGCCTACTCG AAAATACCAA CTCTTTGGGT 600 

GTGGTTCGCT ATAAGGGCTT TCGCCTTTAC ATAGTGACAT GAATAACCAC GGAGTATATA 660 
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GGTCCTTGGC TAATCGTGCG AAATCCAAGT ATCGATTCAA aCCATGATT GTGATAGCTT 720 

GGTAGAGTAG CATAGOGGCA AGAGCAATCT CCGAGATGGC TAATATGAAG AATTTTACAA 780 

GATACTCGGA CATAAGTaC CAGAGGTGAC TTaCAAAGT TCTTTCCTGG AGGaCTTCA 840 

ACGCXJATGCC ATTCTCTTGA CCCCACATTG ATTGAATACX} TCCCCCACAG CTTCACTGGA 900 

ACATGTTGGC TTGTGCCAAG TGAGAGCAAT CGTACCTCAT CTCATCATAT ATGGATATAT 960 

GTCGTATACC TGCTTCCGAA GGTTGAAAGT CAAGGATGGA AAAGAAAAAG GAGGAGAGTT 1020 

CTAGATTCAT GCGGGGGAGT GGTTGTGGTG TTAaAACTT ACTCCGTACT GATCaACGA 1080 

aGCCAGGTG ATATTGATAT GAATATCGAA TGATGATTGG GTGGATGGGG TTTGTTCCGA 1140 

AAGAGGGTTA GTTCGTTGTG GAAGGGTAGA TTTTACGATT GGAAGATAAT GTGGGATAGG 1200 

GCTAACX;CGG CCTGCCTGAA CCCTCAGGCC GGAGCCGGGT GTTACATACT GACCGCCCAG 1260 

CGGCTGAGTT AATGAGAACA TACACATGAT TCGAAGTTTC AGGTCGATGG TGGAGGGGAT 1320 

ACATTCAAAA TTAGTACTTA AGAGAATTAA ATACGAGGCT AATGAGGTTA ACTGCTCTAC 1380 

TTACCCCAGG AGAGGAGGAA AAACTGCACC ATCATCGTTT GGTGGTGTTG GTCATGTGAT 1440 

aiCAATCAC CTGACTCTGA ACCCGTTAAG CGAGACTTGG CAGAAATCGG AATGCACACC 1500 

AAAATTGCCC ATCCTCACAA CCACCOGAAC AACATCGCAG CTTTCGAGGT CGACAACTAA 1560 

TACGCCAAA 1569 



CAGGTATHC CTTTAGTCGG TCCAGCCTTA CCATTTCCCT GCATTGAATG ACAACACTGT 60 

CGCATTTTTT ATACGCCAAT AATAGCCATG GCCATTCTCT CACAACATCA CAGGTCAAGC 120 

AGACTGCAAG TGGTTTCTTG TGGTGGaGT ACTATCAAAA CGATAGAATT CCCATTTGCT 180 

TCGTTGGCGG TAGTAACCTC CACCAATGCT TTGTGGACTT GATGAAGTGT ACACGTCCAC 240 

TGCCTGTCAA AATGGCACGT AATACATTCC AATCCACAGG CTAGAAGGCT TGTGCGGTGT 300 

CCACAGACGA TCAATCCTCA TAGGTTTCTG ACAGTCTCAC CCAGTGTTAG CCGTCGATGC 360 

GGTCTCaiG GTTATGCTTT CATTATCACT CCATATGTCT CTGCTCTCCC TCTCCACTGC 420 

Ga}CATAGCT TGAGCAACCT CCTCAAGCTT AGCTACCACA TCCAATACGC CGTTGTTGAC 480 

CCTATCXJAAA TAGCGATCAT TCGCTTCACG CCTCTCGCGT GCCAGAAGAA TGTCCTTTTC 540 

TGTTTCCTCA TCTAGATTCC TCTGGGAGGC ATCATCCGTT GCGGTTTGTT CTCCCGTGTG 600 

AaAGTGGCA GGAGAGGATG CTCTTGGTCT TGATTGACTC TTGGTAGCTG CATTCTTCGG 660 

ATCAACTACC TCATCAGCTT OGCTTCGAGC CCAGGCCTGG GCAAGCCTCA GCTGCTTTGT 720 

TCGCAGGTCT TCAAGGGCCT CAGAGTGCTT TTCGCCAAAG ACATGTTCAC TAGCTACCGC 780 

ACTCAGCTCG ACCTCTGACA AGGCATTCTG CATTTCTTGC CAGATGGCGT CCGAGCGATC 840 

AAGTTCGCCT TTTGATTGTG TACGAGATAG AAAGGGGGAG AGAGAGCCTG GGGTCTCAAG 900 

GCCGCCTTGC TGGCGCGAGG AACGATTCCG CTGCAGCGCA GGACGCTGAA CiXCTCCACC 960 

GGGAGAAACC GAGGGCACCG GAGGTGTGAG AGATATGTTA GATCCATTAT GGGAAGTGGG 1020 

TGGACCTTGA AAAAGAACAG GGGGAGGGAG ACGCGCAGAG GGAATTTTCC TCAGATTCTC 1080 

TAATTTTGGA GCCX3CATCCA TGGTGTTCGC GAAGTATACG TTGTTGGCAG GCGAACTAAA 1140 

ACAGAACAAG CTaAATTGA CAGCCGTAAA TAAAGGGAGC TTAATCTCCX5 CATACAATCA 1200 

ATGCTGACGG TCACTGATAA GGGGCTTATC GATATCGGAG AACGGCGGAA GTTGTTCCGA 1260 

TCCCGTGCCT CGCTTAGCGG CCGATTAGGG CGGAGCGCGC ACAGCAGAAC ATTTTGGTGA 1320 

GCCTTCAGCG ACACCGCCGG CCGTGAAAGG AACCCTTCGA CTCTCGACAA CCATCCCCAT 1380 

CAGGCCTHC GGCAATCTAT ATCAGTCGGT ATGTATATTG TTGCTGCTAT TTTTTGATGA 1440 

AATATATTCG TGTCTAGGGA CTCCCGTTCA ACATCGAATC TTTTGATTTT TGATTTTCCA 1500 

CCGCCCCGGC CATGTACTTC TGAAATCCAC GTCCCCTCCA CAAATCACGA AATCACCAAG 1560 

CATCAGATTG CCACAGCAAG ATCTAACGCA TTCTTCTTCT ACAGACCGTC CTTCGAACAT 1620 

AATCCGCCAA G 1631 
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